PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim12g005830.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 752aa    MW: 82738.9 Da    PI: 4.9578
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim12g005830.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.12.3e-2178134157
                         TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
            Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57 
                         r++ +++t++q++eLe++F+ +++p+ ++r+eL+++lgL+  qVk+WFqN+R+++k+
  Sopim12g005830.0.1  78 RKRYHRHTQHQIQELEAYFKDCPHPDDKQRKELSRELGLEPLQVKFWFQNKRTQMKN 134
                         789999************************************************995 PP

2START182.23e-572744904205
                         HHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEE CS
               START   4 eeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaet 81 
                          +a++e+v++a+ +ep+W+ s      ++n+de+ + f+++ +     + +ea r+s +v+m+  +lv++l++++  W   +     +a+ 
  Sopim12g005830.0.1 274 MSAMDEFVQMAYIQEPLWSPSIenaaFMLNEDEYFRVFPRGIGprpigFVTEATRDSSIVIMNKLDLVQILMNVN-HWASIFSgivsRASC 363
                         579*******************999999************999********************************.*************** PP

                         EEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEE CS
               START  82 levissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvt 165
                         ++vis+g      g++q++ ae+q+ sp vp R+f+f+Ry++   +g+w+ivdvS+d+ +  p      R++++pSg+lie+ +ng skvt
  Sopim12g005830.0.1 364 VDVISTGgagnpnGVIQVILAETQVASPQVPtREFYFARYCKLRVDGTWAIVDVSLDQLRAGPA----LRSRKRPSGCLIEDAPNGCSKVT 450
                         ***************************************************************5....*********************** PP

                         EEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
               START 166 wvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                         wvehv+++++++h ++++lv+sgla+gak+w+a+l+rqc 
  Sopim12g005830.0.1 451 WVEHVEVDASTVHTIYKPLVSSGLAFGAKRWLAALDRQCD 490
                         **************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.603.7E-2262133IPR009057Homeodomain-like
SuperFamilySSF466897.94E-2065135IPR009057Homeodomain-like
PROSITE profilePS5007117.45975135IPR001356Homeobox domain
SMARTSM003892.7E-2176139IPR001356Homeobox domain
CDDcd000864.72E-2077135No hitNo description
PfamPF000466.9E-1978133IPR001356Homeobox domain
PROSITE patternPS000270110133IPR017970Homeobox, conserved site
PROSITE profilePS5084839.991262494IPR002913START domain
SuperFamilySSF559612.09E-27264493No hitNo description
CDDcd088753.96E-107266489No hitNo description
SMARTSM002341.4E-55271491IPR002913START domain
PfamPF018523.5E-49274490IPR002913START domain
Gene3DG3DSA:3.30.530.201.4E-5357485IPR023393START-like domain
SuperFamilySSF559612.53E-16510743No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 752 aa     Download sequence    Send to blast
MPAVHSRPAN LVPPLVENDG TNDTEIPLGQ PSNMMDNQPN SMENVTNTLD TSIGQDLGGG  60
DDDHNDDQEE GETATGKRKR YHRHTQHQIQ ELEAYFKDCP HPDDKQRKEL SRELGLEPLQ  120
VKFWFQNKRT QMKNQLERHE NTHLRIEIER MKSEMLRLKE ALGNLSCPTC GGHRINIGDL  180
SFEERQLRSE NARLKEEIER ISSMAGRVVG KSVFPNNNNG YISAPPVDFG VLPAYSMPQR  240
VDGNGGFEDV PIMSPMIIPP ISEIEKPFII ELVMSAMDEF VQMAYIQEPL WSPSIENAAF  300
MLNEDEYFRV FPRGIGPRPI GFVTEATRDS SIVIMNKLDL VQILMNVNHW ASIFSGIVSR  360
ASCVDVISTG GAGNPNGVIQ VILAETQVAS PQVPTREFYF ARYCKLRVDG TWAIVDVSLD  420
QLRAGPALRS RKRPSGCLIE DAPNGCSKVT WVEHVEVDAS TVHTIYKPLV SSGLAFGAKR  480
WLAALDRQCD RIASVLVTPT LNNDDMLLTT EESRKNVLKL AERMINCFSS GVSATVGNQW  540
TTVSAGNGTD ENVRIMTRKT VGDPSRPPGI VLSAATSFWL PIAPKTVFDY LRNENTRIEW  600
DILSNGSPLQ EIVQIANGTQ TGNCVSLLRM IGENDNQSNI VILQENSTDP TGSYVIYAPV  660
EICTMNTILN GGDPEHVSLL PSGFAILPDG PPGVRGTYNH GSGGSLLTVA FQILVEFVPV  720
ANISLGSVAT VNNLIACTIE RIKAALLPQP T*
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755240.0HG975524.1 Solanum lycopersicum chromosome ch12, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004252035.10.0PREDICTED: homeobox-leucine zipper protein HDG2-like
SwissprotQ6ZAR00.0ROC1_ORYSJ; Homeobox-leucine zipper protein ROC1
TrEMBLK4DBC10.0K4DBC1_SOLLC; Uncharacterized protein
STRINGSolyc12g005830.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA9322491
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G05230.40.0homeodomain GLABROUS 2